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We address the question of distance record-setting by a random walker in the presence of measurement error, 
5, and additive noise, 7 and show that the mean number of (upper) records up to n steps still grows universally 



as {Rn) 



,1/2 



for large n for all jump distributions, including Levy flights, and for all 5 and 7. In contrast to 



the universal growth exponent of 1/2, the pace of record setting, measured by the pre-factor of n^''^, depends 
on 5 and 7. In the absence of noise (7 = 0), the pre-factor S[5) is evaluated explicitly for arbitrary jump 
distributions and it decreases monotonically with increasing 5 whereas, in case of perfect measurement (5 = 0), 
the corresponding pre-factor T(7) increases with 7. Our analytical results are supported by extensive numerical 
simulations and qualitatively similar results are found in two and three dimensions. 
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An upper record (record, for short) occurs at step n in 
a time series if the n-th entry exceeds all previous entries. 
The statistics of record breaking events in a discrete- time se- 
ries with independent and identically distributed (i.i.d) entries 
have been studied in statistics and mathematics literature for 
a long time Hl-lstl- Recent years have seen a resurgence of 
interest in record statistics, which play a major role in the 
analysis of time series in a number of diverse contexts, in- 
cluding sports b iological evolution models lH,!^, the- 
ory of spin-glasses OKXll ih . rnodels of growing networks 1 12 1, 
analysis of climate data llsMlTll . and quantum chaos jlSll . 
The quantity of central interest is the mean number of records 
{Rn) up to Step n. For a time series with i.i.d entries, a striking 
universal result is that (i?n) ^ Inn for large n IH], indepen- 
dent of the distribution of the individual entries. However, this 
universal logarithmic growth breaks down when the entries of 
the time series are strongly correlated, the simplest example 
being the case of a random walk where the entries of the time 
series represent the positions of the walker at discrete time 
steps. 

While the subject of random walks has an enormous range 
of applications well beyond the original context of diffusion 
and Brownian motion, its exploration in terms of record set- 
ting is relatively recent. The basic question is: how often does 
a random walker, moving in continuous space by jumping 
a random distance at each discrete time step, set a distance 
record, i.e., advance farther from the origin than at all prior 
steps? In other words, how does the mean number of such 
record-setting events grow with the number of steps? This is a 
very natural question in many different contexts, such as in the 
evolution of stock prices lfl9il20ll or in queueing theory |21 1. 



In the one-dimensional case, with pure diffusion but no over- 
all drift, a universally valid result was found in [22] for the 
mean of the upper record-setting events {Rn), namely, that 
it scales as {2/ ^) v}/"^ for large n, where n is the number 
of steps, regardless of the length distribution of jumps (e.g., 
holds even for Levy flights). This square root growth of the 
mean record number was also found numerically in two and 



three dimensions, and considering a drift, an abrupt shift in 
the scaling exponent from 1/2 to 1 was identified fi23i] . Exact 
analytical results were also found in one dimension for a ran- 
dom walker with arbitrary drift | iH 25 1, for a continuous time 
random walker ll26ll and for multiple random walkers ll27ll . In 
the latter case, the theoretical results were in good agreement 
with an analysis of multiple stocks from the Standard & Poors 
500 index lH. 

However, to apply the above results to interpretation of real 
experiments, one needs to re-examine the notion of a record 
because the phrase "advance farther from the origin than at 
all prior time steps" requires closer examination. Why? Be- 
cause all real measurements involve instrument error, 5, and 
noise, 7, is unavoidable. For instance, 6 can be the assur- 
ance hmit of the detector while 7 can describe white noise 
from an instrument reading. Ties become possible because 
of the "fuzziness", as discussed, for example, in 
Hence, the question arises: how does the presence of mea- 
surement error 5 or noise 7 affect the growth of record num- 
ber and the associated record-setting pace? Related ques- 
tions were raised in the statistics literature, e.g., in terms of 
(5-exceedance records llsoi 31 1 and very recently in the physics 



literature 12911 . but asymptotic results are available only for 
time series with i.i.d entries. To the best of our knowledge, 
the question has never been raised in the context of corre- 
lated entries such as random walks. For example, does the 
{Rn) ^ ri}!'^ scaling persist despite the presence of (5 or 7 
and for various jump length distributions? If so, how is the 
pre-factor affected? We address these questions here using 
exact calculations and detailed Monte Carlo simulations. As 
a preview, our major finding is the decoupling between the 
growth exponent which remains universal and the pre-factor 
which carries the "burden" of finite precision and noise. 

Rather than working with absolute values, we define a "one- 
sided" record (only positive maxima are considered) so that 
the i-th entry in a time series, xi, is a record-breaking event 
(record, for short) if it exceeds all previous values in the se- 
quence, i.e., if Xi > max {xi,X2, ■ ■ ■ , Xi^i). We shall hence- 
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forth interpret the distance of the random walker from 
the origin at the i-th time step. However, because of the pres- 
ence of (a fixed) measurement error, 6, we shall now define 
Xi to be a record-breaking event ((5-record, for short) only if 
it exceeds all previous values in the sequence by, at least, 6. 
Similarly, accounting for measurement noise, Xi is a record- 
breaking event if, with the addition of 7 (white noise), it ex- 
ceeds all previous values in the sequence. A subtlety is that in 
the presence of error, a record can be defined as being larger 
- by the amount of the error - than the last record, or than 
the last maximum, the two being identical in the absence of 
error. In the analysis below, we enumerate records larger than 
the previous maximum, as it is more amenable to theoretical 
development as we show below. 

We focus first on the influence of measurement error S. 
Consider a discrete-time sequence {xq = 0,xi,X2, ■ ■ ■ 
representing the position of a one-dimensional random walker 
starting at the origin xq = 0. The position x„-i at step m is 
a continuous stochastic variable that evolves via the Markov 
rule, Xm = Xm-i + 11m where 77^ represents the jump at step 
TO. The noise variables ?7m's are independent and identically 
distributed random variables, each drawn from a symmetric 
and continuous jump density /(r;). Note that although ?7m's 
are uncorrected, correlated random variables. We 

are interested in the statistics of the number of records i?„ up 
to step n. A record occurs at step m if Xm — 5 > Xk for all 
A: = 0, 1, 2, . . . , (m — 1) where (5 > represents the measure- 
ment error For 5 = Q, the statistics of i?„ are known to be 
universal, i.e., independent of the jump density f{r]) ll22ll . For 
instance, the mean record number up to step n is given 
by the expression 1122 1 



(i?„) = (2n+l)(^''|2- 
n 



rl/2 



,1/2 



(1) 



We now proceed to examine how is affected by the 
measurement error 5. Define an indicator a„i = {1,0} at 
each step to so that (Jm = 1 if a record occurs at step to 
and is otherwise. We call = a record, i.e., (Tq = 1. 
Then evidently the number of records i?„ up to step n in a 
given sequence is _R„ ~ X]m=o''^™- Next, we average this 
expression over different histories. Because fT,„ is a binary 
{1, 0} variable, its average {am) is just the probability that a 
record occurs at step m. Hence, 



{Rn 



(2) 
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where rm{5) denotes the record rate, i.e., the probabiUty that 
a record occurs at step m. By definition, rg = 1. Hence, 



^{5) = Prob [xm - (5 > max [0, xi,X2,. 



, Xr, 



(3) 



Thus, rm{5) is the probability of the event that the random 
walker, starting at the origin, reaches a-,„ at step to, while 
staying below — <5 at all intermediate steps between and 



TO, where one needs to finally integrate over all Xm > To 
compute this probability, it is convenient to change variables 
Vk = ^m — Xm-k^ i-C-- obscrvc the sequence {yk} with respect 
to the last position and measure time backwards. Then, r„i{d) 
is the probability that the new walker yk, starting at the new 
origin at fc = 0, makes a jump > S at the first step and then 
subsequently up to to steps stays above 6, i.e., 

r^{S) = Prob [yi>S,y2>5,..., y„, > S\yo - 0] . (4) 



To compute the probability r„j (S) in (|4]i, we note that in the 
first step, the walker jumps to yi = z + 5 from j/o = where 
z > and subsequently up to (to — 1) steps it stays above the 
level 6. Writing yk ~ Zk + S, we can re-express r„i((5) as 



f {z + 5) qm-i{z)dz 



(5) 



where qn{z) is the probability that a random walker, starting 
initially at z, stays positive up to n steps. This persistence 
probability qn{z) has been thoroughly studied in the literature 
for random walks (for a review, see 113 2t ) with arbitrary jump 
density f{ri) and a general expression for its Laplace trans- 
form is known as the Pollaczek-Spitzer formula [|33, 34 1. It 
states that 



1*00 -1 



(s,A) (6) 



where 



(s, A) = exp 



;^ roohlil^ sj{k) 



A2 + A:2 



dk 



(7) 



and /(fc) ~ f{r]) e*'^'' di] is the Fourier transform of the 
jump density /(r;). Note that when 5 — > 0, the integral in ^ is 
just(7„i(0). Thusr„j(0) = qm(0)- From the Pollaczek-Spitzer 
formula in (|6]l, one can show 13211 that J2m=o1"i-W^''"^ ~ 
~ s, independent of the jum p de nsity. This is the cel- 
ebrated Sparre Andersen theorem OSll and when inverted it 
simply gives q„i (0) = (^™) 2"^™. When substituted back in 



(|2|i, it then provides the universal result ||22|| in dl} 



However, we are interested in the case of (5 > 0. To com- 
pute rm (S) for large to in Q, we need to know the large m 
behavior of qm{z) for a fixed z > 0. This can be extracted 
by analyzing Q near s = 1. One finds that the leading order 
behavior of the right hand side of (|6) near s = 1 is simply 
[0(1, A)/A](l — s)^i/2. This means that qn{z) for large n, 
with fixed z, must behave like g„(z) « h{z) / ^ir n. Substitut- 
ing this on the left side of (|6]l and analyzing the leading behav- 
ior near s = 1 shows that the left hand side of (|6]l, near s = 1, 
behaves as h{X){l — s)^^/^, where h{X) = h{z)e^^^dz 
is the Laplace transform of h{z). Comparing the left and right 
sides of (|6]l, we obtain, for large n 
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tion, f{r]) = e-'''/2T^/727rcr2, one finds that 



(Z„(z)«-^ with h{X)= f h{z)e-^'dz = ^<l>{l,X) 
y/nn Jq a 

(8) 

where 0(1, A) can be read off (|6]l as 

^ .-ln(l-/(fc) 



(1, A) = cxp 



A2 +fc2 



dk 



(9) 



Substituting the asymptotic behavior of qn{z) from dH) in Q, 
we obtain, for large m. 



f/((5) = / dzf{z + 5)h{z). (10) 



Finally, substituting this asymptotic behavior of the record 
rate rm{S) in Eq. ^ and performing the sum for large n, we 
find that the mean number of records for large n is given by 



(Rn) 



,1/2 



2 r°° 
, S{6) = ^ / /( 
V'^ Jo 



z + 5)h{z)dz . 

(11) 

This is the main exact result: for an arbitrary jump density 
f{r]), the mean record number grows universally as n^^^ for 
large n (as in the 5 = case), while the pre-factor S{5) de- 
pends on S and does so non-universally insofar as its expres- 
sion depends explicitly on the jump density /(r/). 

Although we have an exact expression for S{5) for arbi- 
trary /(ry), its exphcit evaluation for all 5 is difficult. For 
instance, to compute it explicitly for arbitrary jump density 
f{ri), we need to first compute its Fourier transform /(fc), 
evaluate (/)(1, A)/A from (|9|l, then invert the Laplace transform 
dH) to obtain h{z) and finally perform the integral in (ITTl to 
determine the amplitude S{6). 

For the special case of an exponential jump density, f{ri) = 
(6/2) exp[— 6 I77I], it is possible to evaluate the pre-factor S{6). 
Here, /(fc) = h"^ /{h"^ + fc^); substituting this in the expression 
of 0(1, A) and performing the integral yields 0(1, A) = (6 + 
A)/A. Hence, h{\) = {b + \)/\'^. This Laplace transform can 
be readily inverted to give h{z) = 1 + bz. Using this explicit 
form of h{z) in the expression for S{S) in ( fTTT ) and performing 
the integral yields an exact expression for the pre-factor, valid 
for all (5 > 



S(5) = — cxp [-55] 
\/7r 



(12) 



Note that when S 
2/v^. 



0, one recovers the universal pre-factor 



Consider next a jump density, /(jy), whose tail decays as 
f{ri) ^ exp[— I?;!"] for large 77, where a > 0. Substitut- 
ing this in the expression for S{6) in (fTTI) . expanding for 
large 5 and using h{0) = 1, one can show that for large 6, 
S{S) ^ iji"" e~*". For example, for the Gaussian distribu- 



SiS) 



(5->oo TT S 



(13) 



Finally, consider jump densities with power law tails. 



fiv) 



'1 for large 77 with /i > 0. For Levy flights. 



< /i < 2, whereas for jump densities with a finite variance, 
/i > 2. The Fourier transform, f{k), for small k, generically 
behaves as 



fik) 



1 - \ak[ 



Oik'). 



(14) 



In this case, for large 6 the dominant contribution to the in- 
tegral S{S) = (2/0r) h{z)f{z + S)dz comes from the 
large z region. For large z, one can show that h{z) has the 
asymptotic behavior 

h(z) w — j:—^ r^z''/' for u < 2 (15) 

— z for/i>2. (16) 



Now, consider 5(5) = (2/v^) f{z + S) h{z) dz. We first 
rescale z = Sy. This gives S{6) = {2/^/tt)S /q°° /(<5(y + 
1)) h{y6) dy. For large S, we use (fT6b to obtain. 



SiS) 



6^00 



(17) 



where a = /i/2 for fi < 2 and a ~ 1 for /i > 2. Thus, in this 
case S'(5) decays as a power law for large 6. 

To test these analytical predictions we have performed 
Monte Carlo simulations for the three jump densities: (i) 
/(77) = (1/2) exp[-|77|] (Exponential); (ii) f{r,) = 
(l/-\/27r) cxp[— 772/2] (Gaussian), and (iii) /(7y) drawn from 
a Levy distribution with Levy exponent ^ ~ 1. The Levy 
random number was generated using the method of ll36ll and 
1 37|] . While (i) and (ii) represent normal Fickian diffusion, the 
Levy case represents non-Fickian (anomalous) diffusion; the 
latter can arise in diverse heterogeneous domains such as cells 
1381. cold atoms li39ll . and disordered porous media 140, 41 1. 

Our simulations are conducted with an ensemble of inde- 
pendent random walkers, each entering the one-dimensional 
system at the origin, and with the jump length at each step 
drawn independently from a given pdf. In every simulation, 
5000 particles take 10^ steps each. In all cases, the particle 
is moved from step to step according to its actual (sampled) 
location, without including the value of 5; 5 is added as a 
fixed fraction of the mean (median, for the Levy pdf) jump 
length. At each step, the location of the particle is calculated 
and the current distance value must exceed the last maximum 
by at least the measurement error 6 to qualify as a new 6- 
record and be counted; otherwise we ignore it. The simula- 
tions confirm the 71^/2 scaling for the growth of mean num- 
ber of 5-records, for all values of S. Furthermore, the three 
analytical predictions for S{S) in (fT2] i. (T3[ and (fTTI i are com- 
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pared to Monte Carlo simulations in Fig. [T] the agreement 
is excellent. The pre-factor S{6) decreases from its universal 
value 5'(0) = as i5 increases, so that fewer records are 

counted as the error increases. It is seen that the decrease in 
S{S) is steepest for the Gaussian pdf and has a much slower 
decay for the Levy pdf, in complete agreement with theory. 
The slowing down in the Levy case is due to the anomalously 
skewed nature of the pdf, with frequent small jumps and rare 
but enormous leaps; as a consequence, potential records set by 
small jumps are more prone to being eliminated by the 6 error 
In contrast, the Gaussian case with a ~ 1 displays a rapid de- 
cline with the increasing error, due to the compactness of the 
pdf, so that large jumps are quite rare and record events larger 
than the error are rarer yet. 

We now proceed to examine the influence of the measure- 
ment noise 7. Let {xq = 0, xi , 2:2, . . . , a;„} represent the suc- 
cessive positions of the random walker In this case, a record 
is registered at step m if 

Xm + A/'(0,7) Ax > max(0,a;o,a;i, . . . ,Xm-i) (18) 

where J\f{0, 7) is a zero-mean Gaussian random variable with 
a standard deviation 7. The characteristic magnitude of the 
jump length. Ax, is chosen as Aa; = 1 for the exponential pdf 
(i) and Ax = a ~ \ for the Gaussian pdf (ii); for the Levy pdf 
(iii), Aa; is the median of the one-sided Levy distribution with 
II — 1. The term A/'(0, 7) Aa; in ( fTSb mimics the measurement 
noise. The noise is added for the purpose of record verification 
at each step and is not accumulated to the actual sequence. An 
analytical treatment analogous to that for 5 is not yet available 
and we resort to numerical experiments, similar to those for 5, 
with the results shown in Fig. |2] 

While the scaling ~ ^(7) n^^"^ for large n persists, 

in stark contrast to the S{5), the pre-factor r(7) shown in 
Fig. |2]is an increasing function of 7 for all jump densities. 
Thus for 7-records, the noise adds spuriously to the record- 
setting events, leading to false accounting of records and ren- 
dering an apparent larger than the actual one. This spu- 
riously large rate of record formation increases with the mag- 
nitude of the noise and suggests that it might be possible to 
infer "signal-to-noise" ratio in diffusion-type experiments by 
means of record counting. 

For 7-records, the universality of record-setting affords the 
opportunity to estimate, a priori, the magnitude of 7 in an ex- 
periment involving an ensemble of measurements. At least 
in principle, one first determines from an experiment the pdf 
of the jump lengths in the domain. This pdf can then be em- 
ployed in random walk simulations, as shown above, to gen- 
erate a curve for the pre-factor T(7) (such as seen in Fig. |2l). 
Returning then to an ensemble of experimental measurements 
in the real system, one determines T and then reads off the 
corresponding value of 7 from the simulated T{"i) curve. This 
may provide a practical and simple algorithm to estimate the 
measurement noise 7 in a given experimental setup. 

The results presented here illustrate the subtlety and rich- 
ness of record-breaking and counting, in the presence of in- 



strumental error 5 and measurement noise 7, in systems where 
the underlying process can be modelled by a random walk. 
The decoupling of the growth exponent (1/2, regardless of 
precision and noise) from the pre-factor (which depends on 
instrumental precision and noise in a monotonic, contrasting, 
and pdf-dependent manner) is significant. While the univer- 
sality of the mean record number persists, (i?„) ~ n^/"^, the 
magnitude of the pre-factor is sensitive to the presence of 5 or 
7. Such sensitivity can, perhaps, be exploited in real experi- 
ments to infer instrumental uncertainty and noise from record- 
counting data. 

Finally, we note that all the above Monte Carlo simula- 
tions were also performed on 2d and 3d orthogonal lattices. 
The universality of the n^/"^ record-setting scaling is robust 
for all dimensions, and in all cases, the pre-factors displayed 
qualitative behaviors similar to those shown in Figs. [T] and 
|2] Moreover, Monte Carlo simulations accounting for two- 
sided records (absolute distance) demonstrated the same v}/"^ 
and similar qualitative behavior for the dependence of the pre- 
factors on 5 and 7. 
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FIG. 1: One-dimensional pre-factor S{S) versus measurement er- 
ror S with Gaussian (stars), exponential (squares; 6 = 1) and Levy 
(circles; fi = 1) jump length pdf's. The curves (dotted-dashed, 
dashed, solid) are the corresponding analytical results from il3\ , 
il2i and ( 117b with, respectively, functional forms ^ exp[— <5^/2], 
(2/7r^/^) exp(-5), and 0.69 In the Levy case, /i = 1, hence 

a = /i/2 = 1/2, and the theoretical prediction ~ 6^^^^ in Eq. (TT} 
is consistent with simulations. 




FIG. 2: Pre-factor T{'y) as a function of the measurement noise 7 for 
jump lengths (in one dimension) with Gaussian (stars), exponential 
(squares; b = 1) and Levy (circles; jj, — 1) pdf's. The curves repre- 
sent quadratic fits of functional form a + 67^ with different values of 
a, b. 



